Search CORE

4 research outputs found

Recommended from our members

A Feature Analysis for Multimodal News Retrieval

Author: Tahmasebzadeh Golsa
Hakimov Sherzod
Müller-Budack Eric
Ewerth Ralph
Publication venue: Aachen : RWTH
Publication date: 01/01/2020
Field of study

Content-based information retrieval is based on the information contained in documents rather than using metadata such as keywords. Most information retrieval methods are either based on text or image. In this paper, we investigate the usefulness of multimodal features for cross-lingual news search in various domains: politics, health, environment, sport, and finance. To this end, we consider five feature types for image and text and compare the performance of the retrieval system using different combinations. Experimental results show that retrieval results can be improved when considering both visual and textual information. In addition, it is observed that among textual features entity overlap outperforms word embeddings, while geolocation embeddings achieve better performance among visual features in the retrieval task

Repositorium für Naturwissenschaften und Technik

A Feature Analysis for Multimodal News Retrieval

Author: Ewerth Ralph
Hakimov Sherzod
Müller-Budack Eric
Tahmasebzadeh Golsa
Publication venue
Publication date: 01/01/2020
Field of study

arXiv.org e-Print Archive

Repositorium für Naturwissenschaften und Technik

Multimodal Geolocation Estimation of News Photos

Author: Caputo Annalina
Crestani Fabio
Davis Brian
Ewerth Ralph
Goeuriot Lorraine
Gurrin Cathal
Hakimov Sherzod
Joho Hideo
Kamps Jaap
Kruschwitz Udo
Maistro Maria
Müller-Budack Eric
Tahmasebzadeh Golsa
Publication venue: Cham : Springer
Publication date: 17/03/2023
Field of study

The widespread growth of multimodal news requires sophisticated approaches to interpret content and relations of different modalities. Images are of utmost importance since they represent a visual gist of the whole news article. For example, it is essential to identify the locations of natural disasters for crisis management or to analyze political or social events across the world. In some cases, verifying the location(s) claimed in a news article might help human assessors or fact-checking efforts to detect misinformation, i.e., fake news. Existing methods for geolocation estimation typically consider only a single modality, e.g., images or text. However, news images can lack sufficient geographical cues to estimate their locations, and the text can refer to various possible locations. In this paper, we propose a novel multimodal approach to predict the geolocation of news photos. To enable this approach, we introduce a novel dataset called Multimodal Geolocation Estimation of News Photos (MMG-NewsPhoto). MMG-NewsPhoto is, so far, the largest dataset for the given task and contains more than half a million news texts with the corresponding image, out of which 3000 photos were manually labeled for the photo geolocation based on information from the image-text pairs. For a fair comparison, we optimize and assess state-of-the-art methods using the new benchmark dataset. Experimental results show the superiority of the multimodal models compared to the unimodal approaches

Institutionelles Repositorium der Leibniz Universität Hannover

Recommended from our members

OEKG: The Open Event Knowledge Graph

Author: Abdollahi Sara
Alves Diego
Amaral Gabriel
Cheema Gullal S.
Gottschalk Simon
Kacupaj Endri
Koutsiana Elisavet
Kuculo Tin
Major Daniela
Mello Caio
Sittar Abdul
Swati
Tahmasebzadeh Golsa
Thakkar Gaurish
Publication venue: Aachen, Germany : RWTH Aachen
Publication date: 01/01/2021
Field of study

Accessing and understanding contemporary and historical events of global impact such as the US elections and the Olympic Games is a major prerequisite for cross-lingual event analytics that investigate event causes, perception and consequences across country borders. In this paper, we present the Open Event Knowledge Graph (OEKG), a multilingual, event-centric, temporal knowledge graph composed of seven different data sets from multiple application domains, including question answering, entity recommendation and named entity recognition. These data sets are all integrated through an easy-to-use and robust pipeline and by linking to the event-centric knowledge graph EventKG. We describe their common schema and demonstrate the use of the OEKG at the example of three use cases: type-specific image retrieval, hybrid question answering over knowledge graphs and news articles, as well as language-specific event recommendation. The OEKG and its query endpoint are publicly available

Repositorium für Naturwissenschaften und Technik